Asymmetric Cheeger cut and application to multi-class unsupervised clustering

نویسنده

  • Xavier Bresson
چکیده

Cheeger cut has recently been shown to provide excellent classification results for two classes. Whereas the classical Cheeger cut favors a 50-50 partition of the graph, we present here an asymmetric variant of the Cheeger cut which favors, for example, a 10-90 partition. This asymmetric Cheeger cut provides a powerful tool for unsupervised multi-class partitioning. We use it in recursive bipartitioning to detach one after the other each of the classes. This asymmetric recursive algorithm handles equally well any number of classes, as opposed to symmetric recursive bipartitioning which is naturally better suited for 2 classes. We obtain an error classification rate of 2.35% and 4.07% for MNIST and USPS benchmark datasets respectively, drastically improving the former 11.7% and 13% error rate obtained in the literature with symmetric Cheeger cut bipartitioning algorithms.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved Automatic Clustering Using a Multi-Objective Evolutionary Algorithm With New Validity measure and application to Credit Scoring

In data mining, clustering is one of the important issues for separation and classification with groups like unsupervised data. In this paper, an attempt has been made to improve and optimize the application of clustering heuristic methods such as Genetic, PSO algorithm, Artificial bee colony algorithm, Harmony Search algorithm and Differential Evolution on the unlabeled data of an Iranian bank...

متن کامل

Cheeger constants, structural balance, and spectral clustering analysis for signed graphs

We introduce a family of multi-way Cheeger-type constants {h k , k = 1, 2, . . . , N} on a signed graph Γ = (G, σ) such that h k = 0 if and only if Γ has k balanced connected components. These constants are switching invariant and bring together in a unified viewpoint a number of important graph-theoretical concepts, including the classical Cheeger constant, the non-bipartiteness parameter of D...

متن کامل

Convergence and Energy Landscape for Cheeger Cut Clustering

This paper provides both theoretical and algorithmic results for the `1-relaxation of the Cheeger cut problem. The `2-relaxation, known as spectral clustering, only loosely relates to the Cheeger cut; however, it is convex and leads to a simple optimization problem. The `1-relaxation, in contrast, is non-convex but is provably equivalent to the original problem. The `1-relaxation therefore trad...

متن کامل

Modified Cheeger and Ratio Cut Methods Using the Ginzburg-Landau Functional for Classification of High-Dimensional Data

Recent advances in clustering have included continuous relaxations of the Cheeger cut problem and those which address its linear approximation using the graph Laplacian. In this paper, we show how to use the graph Laplacian to solve the fully nonlinear Cheeger cut problem, as well as the ratio cut optimization task. Both problems are connected to total variation minimization, and the related Gi...

متن کامل

Oil Reservoirs Classification Using Fuzzy Clustering (RESEARCH NOTE)

Enhanced Oil Recovery (EOR) is a well-known method to increase oil production from oil reservoirs. Applying EOR to a new reservoir is a costly and time consuming process. Incorporating available knowledge of oil reservoirs in the EOR process eliminates these costs and saves operational time and work. This work presents a universal method to apply EOR to reservoirs based on the available data by...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012